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ABSTRACT 

We discuss the theoretical interpretation of observational data concerning the 
clustering of galaxies at high redshifts. Building on the theoretical machinery 
developed by Matarrese et al. (1997), we make detailed quantitative predic- 
tions of galaxy clustering statistics for a variety of cosmological models, taking 
into account differences in spatial geometry and initial fluctuation spectra and 
exploring the role of bias as a complicating factor in these calculations. We 
demonstrate that the usual description of evolution (in terms of the parame- 
ters e and ro) is not useful for realistic galaxy clustering models. We compare 
the detailed predictions of the variation of correlation functions with redshift 
against current observational data to constrain available models of structure 
formation. Theories that fit the present-day abundance of rich clusters are 
generally compatible with the observed redshift evolution of galaxy clustering 
if galaxies are no more than slightly biased at z ~ 1 . We also discuss the inter- 
pretation of a concentration of Lyman-break galaxies found by Steidel et al. 
(1998), coming to the conclusion that such concentrations are not unexpected 
in 'standard' models of structure formation. 

Key words: cosmology: theory - cosmology: observations - large-scale struc- 
ture of Universe - galaxies: formation - galaxies: evolution - galaxies: haloes 
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1 INTRODUCTION 

The direct observation of statistically complete samples of galaxies at high redshift has only recently become techno- 
logically feasible, but it promises to yield important information about the origin and evolution of cosmic structures. 
In particular, the possibility to observe "normal" galaxies (or their progenitors), rather than just quasars or other 
ultraluminous sources, may lead to an understanding of how these objects relate to the distribution of the dark matter 
which is assumed, in most theories, to dominate the density of the Universe. This, in turn, should allow observations 
of galaxy clustering to be used to gain insights about fundamental aspects of cosmological models, as well as learning 
about the way galaxies themselves evolve. But in this field, theory currently lags considerably behind observations, 
with the result that experimental data are sometimes placed in a naive, or perhaps simply incorrect, theoretical 
context. 

In Matarrese et al. (1997; hereafter Paper I), we discussed high-redshift clustering phenomena from a theoretical 
perspective. In particular, we developed a general formalism which one can use to make detailed predictions of 
statistical measures of clustering and which also makes explicit the main sources of theoretical uncertainty in these 
predictions. This formalism allows one to make a realistic assessment of how models of structure formation fare in 
the face of results from particular observational programmes. This formalism is considerably more complicated than 
the usual simple scaling ansatz which forms the framework within which most observational results have previously 
been interpreted. 

It is the main purpose of this paper to deploy the techniques of Paper I in a systematic comparison of currently 
popular structure formation models with the available observational data. The models are all based on the cold dark 
matter (CDM) model, but vary in the amount of dark matter, the initial perturbation spectrum, the background 
cosmology and in the presence or absence of a cosmological constant. These variations have been introduced in an 
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2 Moscardini et al. 

attempt to reconcile the basic idea of CDM with cosmological observations which rule out the simplest version of the 
model; see, e.g. Coles (1996). This detailed comparison requires us to specify the assumptions entering our calculations 
very carefully, so we also take the opportunity to refine the general arguments we made in Paper I. 

The layout of the paper is as follows. In Section 2, we briefly recap the main elements of the approach outlined in 
Paper I and explain the extension to, for example, the case of an open universe or models with cosmological constant. 
We also describe the six models of structure formation we use to compare with the data. We explore the issue of bias 
in Section 3, drawing on some themes that were introduced in Paper I, but focussing on the specific case of galaxy 
formation in hierarchical clustering models. Section 4 contains a critique of some simple theoretical arguments often 
presented in the literature. The detailed comparison of clustering observations with our theoretical predictions is 
made in Section 5, where the relevant data are also described. We also comment on the theoretical interpretation of 
a concentration of galaxies found at redshift ~ 3 by Steidel et al. (1998). We present our main conclusions in Section 
6. 



2 MODELLING THE EVOLUTION OF CLUSTERING 

2.1 Preliminaries 

In order to model the clustering of high-redshift galaxies, and to compare observational data with the predictions 
of different cosmological models, one must confront a number of different issues. Firstly, there is the necessity to 
have a reliable way to follow the redshift evolution of matter correlations. Secondly, one has to model the relationship 
between fluctuations in the mass and fluctuations in observable galaxies, and to consider the possible time evolution of 
this relationship. In the usual language, this means devising a model for the bias. Another ingredient is the calculation 
of effects introduced by the observing process, such as the role of the selection function, and the possible effect of 
redshift-space distortions on measures of clustering. Finally, because of the limited number of galaxies in available 
samples and the need to obtain a reasonable statistical signal-to-noise ratio, observational results are usually presented 
for galaxies spanning a range of redshifts. This last effect means, for example, that the observed correlation function 
of the sample involves a convolution of the real correlation function with the redshift distribution of the objects 
contained in the sample. 

We should also point out that gravitational lensing effects along the observer's past light cone also introduce a bias 
into the observed statistics (e.g. Villumsen 1996; Moessner, Jain & Villumsen 1998). The presence of a correlated shear 
results in increased apparent clustering over and above that produced by the intrinsic galaxy correlations. Likewise 
the distortion introduced by inferring galaxy positions from redshifts also acts to increase clustering statistics with 
respect to the real-space versions, except for projected correlations which do not suffer from this effect. Since these 
effects both act in the same direction, calculations made without taking them into account are not exactly comparable 
with observations: more realistic predictions, however, would always be higher than those we present here. 

In Paper I we developed a formalism that takes into account all these requirements, and most of this section is 
devoted to a brief summary of the machinery that was constructed there. Essentially, Paper I showed that the observed 
correlation function £ b s in a given redshift interval Z is an appropriate weighted average of the mass autocorrelation 
function £ with the mean number of objects N and effective bias factor 6 e ff, defined below in equation (H), in that 
range: 

Cobs(r) = N~ 2 \ d Zl dz 2 Af(zi) Af(z 2 ) b cB ( Zl ) b cB {z 2 ) £(r,z) , (1) 



where N = J dz'Af(z') and z is an intermediate redshift between z\ and z 2 . Porciani (1997), by computing the 
evolution of the two-point correlation function in the Zel'dovich (1970) approximation, showed that adopting the 
relation D+{z) = \fD+ (z\ )D+ (22) , where D+ is the growth law for linear density fluctuations (see below, Section 
2.4), ensures that predictions will be accurate to an error smaller than 1 per cent. In the following we will assume 
this expression to define z. 

2.2 Bias and all that 

The factor of b cB which appears in equation (uh is a consequence of our lack of understanding of the details of the 
galaxy formation process and the consequently uncertain relationship between fluctuations in matter density <5 m and 
galaxy number-density 8 n . It is conventional to parametrise one's ignorance in this arena by introducing a single 
linear bias parameter such that a relationship of the form 8 n = b 5 m is assumed. In this work we shall generalise this 
idea so that we assume that objects with given intrinsic properties (such as mass M) and at different redshifts z can 
have different bias parameters, which we call b(M, z). For each set of objects, however, the bias is assumed still to be 
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Modelling galaxy clustering at high redshift 3 

linear; it is also local, in the sense that the propensity of galaxies to form at a given spatial location x depends only 
on the matter density at that point: 

5 n (x;M,z)~b(M,z)6m(xL,z); (2) 

so that no environmental or co-operative effects in galaxy formation are permitted. If we assume such a bias between 
the galaxy and mass fluctuations, the effective bias factor b e s(z) which appears in equation (hi) can be expressed as 
a suitable average of the "monochromatic" bias b(M, z) (i.e. the bias factor of each single object): 

b cH (z)=jV( Z y 1 dhiM' b(M',z) JV{z,M') . (3) 

J M 

Here the variable M (and its range A4) does not necessarily simply represent the object's mass but rather it stands 
for any generic intrinsic properties of the object (mass, luminosity, etc.) on which the selection of the object into an 
observational sample might depend. 

Because it plays such an important role in this formalism, we have devoted the whole of Section 3 to a more 
detailed discussion of the possible form of a bias and its redshift evolution. 

2.3 Clustering Statistics 

Owing to the relatively small size of the datasets available at the present time, clustering properties of high-redshift 
galaxies are generally studied in terms of the angular [o; b s ('!?)] or of the projected real-space [w bs(»>)] correlation 
functions. Adopting the small-angle approximation, in Paper I we obtained for ui b s '- 

^obs(tf) = N~ 2 / dz G{z) Af 2 (z) bl«{z) / du f [r(«, 0, z),z], (4) 



where r(u,-&,z) = ao\/u 2 + x 2 (z)i9 2 , with x(z) given by 

(l + z') 2 (l + n 0m z') -z' (2 + z')fl A dz'j (5) 



x{z) = 


H a ^/\n\ \ 


sM 


Jo 


and 









G ^-{fz) • ^ 

In equation (B|) and hereafter, we use ilom and Q.qa to represent the contribution at the present time to a critical 
energy density from matter and vacuum energy respectively. When we require these quantities at an arbitrary time 
we use Qm and $1a; since they evolve with epoch, fi m and Ha are implicit functions of z; although the cosmological 
constant A is constant in redshift, Q,a = A/3H 2 , which varies through the Hubble constant H(z). We write the total 
density parameter f2 m + Qa = Qt', consequently f2om + £7oa = ^°«- 

Note also that, while in the Einstein-de Sitter case ao is an arbitrary length-scale which can be set to unity, in 
the non-flat case it is given by 

a = -^-\l-n m \- 1/2 . (7) 

In equation (pi), if Oot < 1, S(x) = sinh(a;) and k, = 1 — Slot; if Oot > 1, S(x) = sin(s) and k — 1 — flot', if 
Qot — 1, S(x) = x and n — 1. In the case of a vanishing cosmological constant, the previous expression can be solved 
analytically and can be written as 

, v _ ic n 0rn z + (n 0m ^2)hi + (n 0m ^ + i) 1/2 ] . . 

The projected real-space correlation function w ba can be directly obtained by £obs(f) as 



w ohs (r p ) = 2 / dy Us(Vr 2 P + y 2 ) = 2 / dr r (r 2 - r 2 p )- 1/2 ( obs (r) , (9) 

■JO Jr p 

where r v is the component of the pair separation perpendicular to the line of sight. 

2.4 Evolution of the mass autocorrelation function 

The non-linear growth of the density fluctuations modifies the shape of the power spectrum P(k) as well as its 
amplitude. In the linear regime, which holds at large scales and/or early times, the solution is given by the relation 
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4 Moscardini et al. 

P(k,z) = D 2 + {z)P{k,z = 0) , (10) 

where D + (z) is the growing mode of linear perturbations normalised to unity at z — 0, so that the spectrum grows 
with time without any change in shape. By contrast, in the strongly non-linear regime, some theoretical arguments 
and numerical simulation results suggest the existence of the so-called stable clustering regime wherein the matter 
correlations exhibit a particular form of self-similar behaviour (Peebles 1980; Jain, Mo & White 1995; Jain 1997), 
although whether the stable clustering description holds in detail is still open to some doubt (e.g. Padmanabhan et 
al. 1996; Munshi et al. 1997). We shall return to this issue later, in Section 4 of this paper. In any case, the non-linear 
behaviour of matter correlations on small scales results in a distortion of the shape of the matter power spectrum 
from its initial form. 

The clustering behaviour which is most relevant to the study of objects at high redshift is actually in between 
these two extremes. This intermediate regime is generally studied by fitting results from numerical simulations with 
a semi-empirical universal function, obtained by following the simple ansatz originally introduced by Hamilton et al. 
(1991). Recognising that gravitational collapse changes the effective length scale of a density perturbation, Hamilton 
et al. (1991) suggested that the initial (linear) scale tq of a density perturbation should be related to the final 
(non-linear) scale r of the same perturbation after collapse by: 

r = [l + ar,z)] 1/:i r, (11) 

where 

Z(r t z) = lJ y 2 Z(y,z)dy (12) 

is the integrated correlation function. The Hamilton et al. idea is that there is a universal function F acting on £, 
once the change in appropriate length scale is taken into account: 

i(r,z) = F[inn(ro,z)]. (13) 

Recently this formalism has been significantly refined and generalised. Peacock & Dodds (1994) considered the 
application of this method to the power spectra rather than the correlation function, and also to models with 
arbitrary background density (i.e. with Slot / 1 or SIoa 7^ 0). Jain, Mo & White (1995) introduced the dependence 
of the universal function F of the primordial spectral index n. Finally Peacock & Dodds (1996, hereafter PD96), 
by using N-body simulations with high spatial resolution, obtained accurate fits for F both in low-density universes 
and universes with non- vanishing cosmological constant. The accuracy of this form of the fitting function has been 
recently confirmed using numerical experiments by Jenkins et al. (1997). 

In Paper I we followed the clustering evolution by using the fitting function obtained by Jain, Mo & White 
(1995). Here, because it is our intention to consider models with flat 7^ 1 or Qoa 7^ 0, we have decided instead to use 
the form of the method presented by PD96 which deals with the (dimensionless) power spectrum A 2 : 

which is related to the two-point correlation function by 
*/ n f » 1,, s sinfcr dk 

« r > = y A <*hfcrT ; (15) 

a 2 is the variance of the density fluctuation field. In this case, the corresponding expressions for equations (113) and 
(0) are: 

A 2 {k,z)=T[Al n (k ,z)] , fc = [l + A 2 (fc,2)n 1/3 fc, (16) 

where fco and k are the linear and non-linear wavenumbers, respectively. We adopt the universal fitting function T 
given by PD96, which depends on g, a suppression factor that measures the rate of clustering growth in a particular 
cosmology relative to that which pertains in a flat matter-dominated Universe. The quantity g therefore contains a 
dependence on the background cosmology. Carroll, Press & Turner (1992) found an approximate (but almost exact) 
expression for g: 

p(n m ,fiA) = |a„[^ /7 -fiA + (l + nm/2)(l + n A /70)]" 1 ; (17) 

Since fl m and Qa both depend upon redshift z (e.g. Section 2.3), and it is the dependence of g on z in which we are 
interested, we can use g(z) to encode this behaviour for any particular cosmological model. The formula given for T 
by PD96 was originally for z = 0, but it applies at any cosmic epoch z by replacing g by g(z). The quantity x must 
also be interpreted as the linear power spectrum at the epoch z, i.e. 
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Modelling galaxy clustering at high redshift 5 

x = AJUfo, *) = A? in (*o, 2 = 0)(1 + z)- 2 [ 3 ( 2 )/ 5 (0)] 2 , (18) 

rather than at z = which is assumed in the original PD96 application. 

The form of T(x) given by PD96 assumes a power-law initial spectrum described by an index n. For models 
which are not described by pure power-law spectra, such as the variations on the cold dark matter model that we 
shall discuss in this paper, one can use the same formulae, but replacing n by an effective index n s s, defined by 

, . dlnPn n (fc, z) I 
n eS (ko,z) = -j-^ , ,,.,„■ (19) 

dlnfc \k=k (z)/2 

PD96 claim that this prescription is able to reproduce the non-linear evolution with a precision of about 7 per cent, 
which is perfectly adequate for the application we have in mind for this paper. 

Some care has to be taken if one seeks to apply the PD96 method to cosmological models where A 2 displays 
a significant peak at some particular wavenumber. Such models are not really consistent with the assumption of 
hierarchical clustering in the first place. For example, tilted cold dark matter models have n e s < —3 for large k, 
which means that they have a very sharp spectral feature and are susceptible to this difficulty (see e.g. Vittorio, 
Matarrese & Lucchin 1988). Of course, the method can still be used to construct the power spectra on large scales 
and it is possible to use safely its predictions down to the linear wavenumber where the spectrum peaks. For many 
variants of tilted cold dark matter models (as those described in the following subsection) , A 2 is sufficiently large 
there that the non-linear power will be very large. One would not want to make predictions based solely on dark 
matter clustering to any smaller scales, so the fact that n e g < —3 on the smallest linear scales is not a problem in 
the context of this work. 



2.5 The cosmological models 

One of the limitations of Paper I was that it restricted attention to a simple phenomenological model of the initial 
power spectrum and to a flat spatial geometry. It is one of the aims of this paper to extend the treatment to study 
a wider range of initial conditions and relevant changes in the global cosmological parameters (including spatial 
curvature). This is a particularly interesting task at the present time because it is well known that the so-called 
standard cold dark matter (SCDM) model — which assumes a flat universe with fio = 1 and £Ia — 0, a spectral index 
n — 1, a Hubble constant (in units of 100 km s _1 Mpc -1 ) h = 0.5 and a baryon fraction Q& = 0.0125ft~ , as predicted 
by the standard theory of the big bang nucleosynthesis — is not able to reproduce the clustering properties of the 
galaxy and cluster distribution and the cluster abundances, when normalized to the COBE data. As a consequence, 
a number of variants on this basic scenario have been suggested which might remedy its shortcomings. In this paper 
we consider different cosmological models which might be viable alternatives to the SCDM model; they all have a 
similar basic shape of power spectrum to SCDM but are engineered to have a smaller amount of small-scale power, 
which is the main problem with SCDM itself. In a general way, the initial (linear regime) power spectrum for all these 
models can be represented by 

Pi in (fc,0)=Pofc n T 2 (fc) , (20) 

where we use the fitting formula of the CDM transfer function as given by Bardeen et al. (1986): 

T(k) = ln(1 2 + 3 g 4g) [1 + 3.89, + (16.1g) 2 + (5.46g) 3 + (6.71g) 4 ] ^ . (21) 

In the previous equation q = (k/h Mpc _1 )/r. The shape parameter T is related to the matter density parameter 
flom and to the baryonic fraction flot by the relation F = Qomhexp[— Clob — \/ft/0.5 iiob/^Om)] (Sugiyama 1995). 

To fix the amplitude of the power spectrum, we either attempt to fit the present-day cluster abundance or the 
level of fluctuations observed by COBE. For the latter, we parametrise the normalisation of the 4-year COBE data 
in terms of as, the r.m.s. fluctuation amplitude inside a sphere of 8/i _1 Mpc, using the results of Bunn & White 
(1997), who used a Karhunen-Loeve expansion to produce an unbiased estimate of the normalization, with a statistical 
uncertainty reduced to 7 per cent. 

We will consider the following specific models, the main parameters of which are described in Table 1: 

• the SCDM model, as reference model, with a normalization consistent with the COBE data; 

• a different version of the SCDM model, hereafter called SCDMcl, with a reduced normalization corresponding 
to us = 0.52 which produces a cluster abundance in better agreement with the observational data (Eke, Cole & Frenk 
1996; see also Viana & Liddle 1996); 

• a tilted model (hereafter TCDM; see e.g. Lucchin & Matarrese 1985; Vittorio, Matarrese & Lucchin 1988) with 
n = 0.8 and high baryonic content (fiob = 0.1; see White et al. 1996; Gheller, Pantano & Moscardini 1998); 
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6 Moscardini et al. 

Table 1. The parameters of the cosmological models. Column 2: the present matter density parameter Qom', Column 3: the 
present cosmological constant contribution to the density Hoa; Column 4: the primordial spectral index n; Column 5: the 
Hubble parameter h; Column 6: the present baryon density Cj; Column 7: the shape parameter T; Column 8: the spectrum 
normalization erg; Column 9: the non-linear value of the r.m.s. fluctuation amplitude inside a sphere of 8h~ x Mpc Cg'. 



Model Qom Qqa n h f2o(, r <rg tjg 1 



SCDM 


1.0 


0.0 


1.0 


0.50 


0.050 


0.45 


1.22 


1.16 


SCDMcl 


1.0 


0.0 


1.0 


0.50 


0.050 


0.45 


0.52 


0.51 


TCDM 


1.0 


0.0 


0.8 


0.50 


0.100 


0.41 


0.72 


0.72 


TCDM GW 


1.0 


0.0 


0.8 


0.50 


0.100 


0.41 


0.51 


0.51 


OCDM 


0.4 


0.0 


1.0 


0.65 


0.036 


0.23 


0.64 


0.66 


ACDM 


0.4 


0.6 


1.0 


0.65 


0.036 


0.23 


1.07 


1.13 



• a different version of the previous model, hereafter TCDMgh' , with a reduced normalization of the scalar per- 
turbations (S) that takes into account the possible production of gravitational waves (tensor perturbations T) to the 
COBE fluctuations [we adopt the ratio T/S = 7(1 — n) for the ratio of tensor to scalar contribution to the quadrupole, 
as predicted by some inflationary theories (e.g. Lucchin, Matarrese & Mollerach 1992; Lidsey & Coles 1992)]; 

• a open CDM model, with Clot = Ho™ = 0.4, COBE-normalized (hereafter OCDM); 

• a low-density CDM model (fiom — 0.4), with flatness provided by the cosmological term, i.e. Qot — 1 and 
Cl A = 1 — n 0m , COBE-normalized (hereafter ACDM). 



3 MODELLING THE BIAS OF GALAXIES 

Though the number of ingredients is large in the models introduced above, the theoretical understanding of how 
clustering of matter grows via gravitational instability in the expanding Universe is quite well developed. As a 
consequence, it is relatively straightforward to compute the autocovariance function of matter fluctuations as a 
function of redshift in these scenarios. As we mentioned in Section 2.2, however, this does not lead us directly to 
a prediction of galaxy correlation properties because we still do not fully understand the details of the relationship 
between the whereabouts of the galaxies and the whereabouts of the mass. In principle, this relationship could be 
highly complicated, non-linear and environment-dependent. If this turns out to be the case then it is going to be very 
difficult indeed ever to unravel galaxy clustering observations to obtain information about the evolution of matter 
fluctuations and the cosmological parameters on which they depend. In this spirit, we are motivated to assume the 
relatively simple form of local bias represented by equation (H), though we do admit at the outset that things could 
be much more complex than this. 

Having settled on equation (|2j), our task is now to determine the behaviour of the function b(M,z) for a given 
theoretical picture. In Paper I, we introduced four different general ideas of how different classes of cosmic objects 
might be related to the mass distribution and parametrised them in terms of the simplified biasing model we adopted. 
In this paper, we shall stick to the same four basic models, but adapt them to the specific situation of galaxy clustering. 

The simplest biasing model one can imagine, and which is regarded by many as the most realistic, is described 
by b(M,z) — 1. This is called the unbiased model, though one has to be a little careful in using this terminology 
and motivation for it, particularly at high redshifts, is actually quite limited. In fact, it is not a trivial question to 
ask what is the formal definition of an unbiased population of objects, since one is attempting to relate two different 
types of mathematical field: a point set and a continuous density field. One useful definition is that the population 
of objects constitutes a random (Poisson) sampling of the matter distribution, such as is the case if galaxies are 
selected by their luminosity which, in turn, is drawn from a universal (position-independent) luminosity function. 
But as galaxy formation occurs, stellar populations and luminosities evolve and galaxies undergo merging and tidal 
disruption, it is difficult to see how b(M, z) can be equal to unity for all properties M and redshifts z. In particular, 
at sufficiently high z one reaches the point where the first galaxy to form in the observable universe produced its 
stars. It clearly makes no sense to describe this object by an unbiased model in the sense we have used it here, even 
if one does not invoke density-dependent luminosity functions or other environmental effects. Moreover, galaxies in 
general must represent some form of subset of the population of collapsed objects and, as we discuss later on, these 
objects are generally biased with respect to the underlying continuum mass distribution. So even though it is very 
simple and, at least at first sight, self-consistent, we do not believe this model to be well motivated in the context of 
this paper so we use it here as a reference for the more complex models which follow. 
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Modelling galaxy clustering at high redshift 7 

An alternative picture of biasing can be constructed by imagining that galaxy formation occurs, for a given class 
of galaxies, at a relatively well-defined redshift Zf. (One can either assume that there is a single typical formation 
redshift for a certain class of galaxies or that there is some spread in it.) If this is the case, one can further imagine 
that galaxies which are born at a given epoch Zf might well be imprinted with a particular value of b(M,Zf), in 
the spirit of equation (0), as long as the formation event is relatively local. If galaxies are biased by birth in this 
way, then they will not continue with the same biasing factor for all time, but will tend to be dragged around by 
the surrounding density fluctuations, which are perhaps populated by objects with a smaller bias parameter. In this 
case, the evolution of the bias factor can be obtained from (Dekel 1986; Dekel & Rees 1987; Nusser & Davis 1994; 
Fry 1996) 

b(z) = l + (b f -l)2±&j-, z<z f , (22) 

where 6/ is the bias at the formation redshift zj ; we have suppressed the dependence on M here for brevity. This was 
called galaxy- conserving model in Paper I. Again, it is difficult to motivate this model in detail because it is difficult to 
believe that all galaxies survive intact from their birth to the present epoch, but it at least gives a plausible indication 
of the sense in which one expects b to evolve if the timescale for galaxy formation is relatively short and the timescale 
under which merging or disruption occurs is relatively long. 

In most fashionable models of structure formation, however, the growth of large-scale features is driven by the 
hierarchical merging of sub-units. In these theories, one would not expect the survival of galaxies in their pristine 
initial state as anticipated in the galaxy-conserving model. Since the development of the clustering hierarchy is driven 
by gravity, the first things one has to understand are the properties of galactic haloes rather than the galaxies residing 
in them. One begins by calculating the bias parameter b(M,z) for haloes of mass M and 'formation redshift' Zf at 
redshift z < zj in a given cosmological model. The result is 

^'^-^^(^feF- 1 )' (23) 

where o~ 2 M is the linear mass-variance averaged over the scale M extrapolated to the present time (z — 0) and 5 C is the 
critical linear overdensity for spherical collapse [S c = const = 1.686 in the Einstein-de Sitter case, while it depends 
slightly on z for more general cosmologies (Lilje 1992)]. The above expression for the bias parameter was originally 
calculated by Mo & White (1996), although for simplicity they only gave results for z — 0. The general non-linear 
relation between the halo and the mass density contrast has been recently obtained by Catelan et al. (1998), by 
solving the continuity equation for dark matter haloes. Bagla (1997b) has discussed the clustering of haloes using 
numerical experiments. See also Ogawa, Roukema & Yamashita (1997) for a related discussion. 

At this point one has to make some assumptions on how the galaxy is connected to the hosting halo and on what 
happens when the halo merges with other haloes. This point has been discussed at some length in the literature (e.g. 
Kauffmann, Nusser & Steinmetz 1997; Roukema et al. 1997) and many issues still remain unresolved: it is one of the 
most complicated aspects of galaxy formation. In order to make progress we shall simply assume in what follows that, 
however star formation and stellar evolution proceeds in a halo once it has formed, the properties of the resulting 
galaxy are in a one-to-one relationship with the parent halo mass M. Using this assumption it now becomes clear 
that we can drop the general interpretation of M in equation (pi), as all properties of the galaxy are reducible to the 
parent halo mass (though see the comments in the last paragraph of this section) . 

Equation (pi) is not the end of the story, however, because it does not tell us anything about what happens to the 
haloes after they have formed and, in particular, says nothing about the timescale of any merging. In the standard 
treatment of hierarchical clustering - the Press-Schechter (1974) theory - all the haloes that exist at a given stage 
merge immediately to form higher mass haloes, so that in practice at each time the only haloes which exist at all 
are those which just formed at that time. If one identifies the galaxies with their hosting haloes, then automatically 
Zf — z in the previous formula, i.e. the galaxy merging rate is automatically assumed to be much faster than the 
cosmological expansion rate. This is at the basis of what in Paper I we called the merging model. Of course this 
instantaneous-merging assumption is physically unrealistic and is related to the fact that one is using a mass variable 
which is continuous, while the aggregates of matter that form are discrete. On the other hand, it does provide a 
reasonable counter to the galaxy-conserving model introduced above. 

The galaxy-conserving model (no merging) and the merging model (rapid merging) can be regarded as two 
extreme pictures of how galaxy formation might proceed. In between these two extremes, one can imagine a more 
general scenario in which galaxies neither survive forever nor merge instantaneously. The price for this greater gen- 
erality is that one requires an additional parameter to be introduced compared to equation (|22]). To understand how 
this intermediate model is constructed, it is easiest to look at how b(M, z) is used to calculate the quantity which is 
really required for observational comparisons, that is 6 e ff(z), which appears in equation (hi). Basically, one takes the 
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Table 2. The best-fit parameters of the relation for the effective bias (E4|) computed for different minimum mass and different 
cosmological models. 
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'monochromatic' (i.e. single mass) bias at each redshift (possibly with some extra parametric dependence on zj differ- 
ent from z) and then averages this monochromatic bias over the mass distribution of objects to obtain the 'effective 
bias', as described in equation (pi). The latter quantity is to be used to connect the underlying mass autocorrelation 
function with the objects two-point function, which also requires convolution with the redshift distribution M(z)\ see 
equation hi). 

As in Paper I, we can estimate the effective bias by assuming that the objects observed in a given survey represent 
all haloes exceeding a certain cutoff mass M m i n at any particular redshift. In other words, we assume that there is 
a selection function 4>{z,M) = Q(M — M m i n ) at any z, where &(•) is the Heaviside step function. This is consistent 
with the reasoning we mentioned above, that all galaxy properties are reducible to the parent halo mass M. In this 
way, by modelling the linear bias at redshift z for haloes of mass M as in equation ( j23| ) and by weighting it with the 
theoretical mass-function n(z, M) which we can self-consistently calculate using the Press-Schechter theory, we can 
obtain the behaviour of b c g(z) directly. The results for different cosmological models are shown by the solid lines in 
Fig. 1, where various choices of the minimum cutoff mass in n(z, M) are shown for reference. 

The behaviour of b B « (z) can be fitted by a relation of the form 

b e «(z) = 1 - l/Sc + [M0) - 1 + l/S c ]/D+(zf . (24) 

The resulting best-fit parameters b e g (0) and j3 are reported in Table for different choices of initial matter fluctuation 
spectrum and minimum mass. In all cases, the effective bias is a increasing function of both redshift and M m i n . Notice 
further that a relatively strong anti-bias b Q g < 1 can be produced at Z — if the minimum mass is small, because all 
the small haloes still existing at a late stage of the clustering hierarchy will tend not to lie in dense regions. 

So far this argument works equally well for the calculation of b e g(z) in the framework of the merging model. But 
notice that the parameters of that model are entirely fixed because one has to match the evolution of the clustering 
hierarchy to observations at the present epoch. This amounts to matching the present-day value of b(z) by relating 
clustering properties of bright galaxies (e.g. the measured value of as for these galaxies) to the analogous quantity 
predicted in a given model for the matter distribution. In terms of the argument given in the preceding paragraph, 
this basically means that M m i n , the free parameter, is fixed by requiring the present population of galaxies to have 
been entirely produced by a merger-driven hierarchy. On the other hand, we might decide that galaxies one might 
happen to see at large redshifts cannot be identified with the ancestors of present-day galaxies. In this case one can 
regard Mmin as a free parameter, but once a minimum mass is chosen, the value of b c g implied at redshift z — will 
probably not correspond to the value of the bias parameter measured for any known class of objects. One must then 
assume that such objects are missing in local surveys, either because they had undergone a transient increase in their 
luminosity and have now faded or because they correspond to extended objects of low surface brightness, visible at 
high redshift, but invisible at small distances, due to selection effects. In Paper I we called this model the transient 
model and we will adopt this nomenclature here. 

The transient model is more strongly motivated from a theoretical point of view for QSOs rather than galaxies. 
Efstathiou & Rees (1988) have used efficiency arguments to obtain a relatively high minimum halo masses for QSOs, 
which may be as large as ~ 10 12 Mq. On the other hand, Haiman & Loeb (1997) have suggested that short-lived 
quasar activity may be possible in haloes of much lower mass than this. The transient model for quasars appears 
to be consistent with observations of the clustering evolution of these objects (e.g. La Franca, Andreani & Cristiani 
1997), but the available QSO data do not rule out alternative models based on the clustering of collapsed objects 
which can behave in a qualitatively similar way to the transient model (Brainerd & Villumsen 1994; Ogawa et al. 
1997; Bagla 1997b). In the galactic setting the choice of halo mass is much less obvious than in the case of QSOs, 
and the haloes that host typical galaxies may well be much smaller than those that host QSOs. 
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Figure 1. The effective bias b e g as a function of the redshift z for the cosmological models considered in this paper. The 
different solid lines refer to different values of the minimum mass M m ; n in the Press— Schechter mass-function, ranging from 
10 9 to 10 14 h~ 1 Mq, from bottom to top. The dotted lines show the effects of the catalogue selection: the results are obtained 
by assuming a bolometric magnitude limit m,, = 24, a mass-to-luminosity ratio M/L = IOMq/Lq and a K+E— correction 
expressed by the relation Oog(l + z), with K = —1, 0, +1 (from bottom to top). 



We should make the point that these simple schemes do not exhaust all the possible scenarios through which 
galaxies might have formed and evolved. For example, it is quite possible that merging could play a different role at 
different redshifts. Present day bright disk galaxies, for example, have clearly not just formed at the present epoch 
since their properties suggest a lack of mergers in the recent past. On the other hand, it is plausible that galaxies at 
much higher redshifts, say z ~ 2, are undergoing merging on the same timescale as the parent haloes. This suggests 
the possible applicability of a model where rapid merging works at high redshift, but it ceases to dominate at lower 
redshifts and the bias then evolves by equation (|22j) until now. In this context it is interesting to note that, while bf 
is a free parameter in equation (123), it is actually predicted by equation (124), once the appropriate minimum mass is 
specified. Thus matching the merging phase (E4j) onto the conserving phase (B2j) gives 
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b f = l + (bo- l)D+\z f ) = 1 - i + («a(0) - 1 + ~) + (^/)-' 3 (25) 

for the bias these objects would have at Zf when they stopped merging. In this equation 6* fl -(0) is to be interpreted 
as the effective bias the galaxies would have now if they continued merging from Zf until now; since the galaxies do 
not do this, the actual present-day bias will be different. Evolving from z = Zf until z = using equation (122]) yields 

60 = : - ^^ + { Ks{0) - 1 + i) D +~ P(zf) ■ (26) 

It is interesting to speculate whether the clustering of present-day 'bright' galaxies can be explained in this picture. 
For example, in an Einstein-de Sitter model, such galaxies must have 60 — 2 if the constraint on og from cluster 
abundances is correct (Eke, Cole & Frenk 1996; Viana & Liddle 1996), since ag measured for these objects is of order 
of unit. In order to obtain bo — 2, we need to have the last term in equation ( |26[ ) to be of order unity. Unless Zf 
is very large, this means a relatively large value of b* s (0) which, from Table 2, requires a relatively large minimum 
mass of order 1O 12 /i -1 M0 for those models with ag ~ 0.5. Such a picture would therefore explain a large present- 
day value of the bias of galaxies with very large haloes, if these galaxies could be identified with objects that for 
some reason had not undergone significant merging in the recent past. Whether this interpretation is consistent with 
observations of the dependence of galaxy merger rates with redshift (e.g. Ellis 1997; Neufschaefer et al. 1997; Roche, 
Eales & Hippelein 1997) is a subject for further study. Notice that in open models and models with non-vanishing 
cosmological constant the value of as coming from the constraint on the cluster abundance is larger (see e.g. Viana & 
Liddle 1996). For example, for Qom — 0.4, as — 0.8 is required. Consequently the present-day 'bright' galaxies must 
have bo — 1.3 and the corresponding minimum mass can be smaller. We shall not investigate this model any further 
in this paper, however, as it contains nothing that makes it qualitatively different from the previous examples. 

We can now summarize these arguments by introducing a simple unified model for bias which incorporates all 
four of these previous examples as special cases. As noticed in Paper I, all these models can be described by the 
equation 

b cB (z) = b-i + (b - b-^/D+iz) , (27) 

with suitable parameters 60, &-i an d 13; 6_i may be interpreted as the bias factor at the end of the era of cosmological 
expansion, i.e. at the maximum expansion in a closed model, or as t — » 00 in an open or flat model. The particular 
examples we consider are: 

• the unbiased model, with b(z) = 1; 

• the transient model, where the parameters are fixed by the choice of the minimum mass (see Table H); we will 
use M min = 10 11 fc _1 .M©; 

• the merging model, where the parameters are fixed by the value of the bias parameter at z — (60 = l/°"s) and 
taking the bias relation for the corresponding minimum mass; 

• the galaxy-conserving model, which has 6_i = /3 — 1 and bo = 1/os. 

In order to be fully consistent with our formalism, we adopt the non-linear value of the r.m.s. fluctuation 
amplitude inside a sphere of 8/i _1 Mpc, <7g , computed by using the PD96 method. The values of a^ 1 for the different 
cosmological models are reported in the last column of Table 111 

In Fig. 2, we display the actual evolution of b e g (z) for each of the cosmologies and for each of these four biasing 
models. Notice that there is a considerable variation in the behaviour expected depending on the cosmology under 
consideration, except (of course) for the unbiased model. We shall return to this in the next Section. 

As final point, we should mention how the bias factor changes when catalogue selection effects are considered, 
i.e. when theoretical quantities, as the mass M, are substituted with the observational ones, such as the luminosity 
L. After choosing one of the models above, one will end up with the quantity b(M, z) to be understood as 'the bias 
that objects of mass M have at redshift 2'. The effective bias at the same redshift is precisely 

b aB (z) = N(z)- 1 f dlnLt> ohs (L)b[M(L),z] , (28) 

where N(z) — J d\nL§ i, B (L) and & b B (L) is the observed luminosity function of the catalogue, i.e. the intrinsic 
luminosity function multiplied by the catalogue selection function, which will typically involve a cut in apparent 
magnitude in whatever wave-band is being used, rather than in the somewhat idealised case we discussed above 
where everything corresponds to a cut in mass M. Because of this cut (for magnitude or flux-limited catalogues), 
in order to obtain 6 e ff one should need to know the distance modulus of the galaxies at a given z, including all 
possible K-corrections and evolutionary (E) effects in order to calculate this exactly. Furthermore, because the bias 
is typically expressed as a function of mass, one needs to know the mass-to-light ratio in the given wave-band. To test 
the size of this effect we computed an illustrative example for all the cosmological models described above, assuming 
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Figure 2. The fits to the function ft e j{(z) for each cosmology discussed in this paper and for the four different biasing models 
described in the text: unbiased model (solid line); galaxy-conserving model (dotted line); merging model (short-dashed line); 
transient model (long-dashed line). 

a bolometric magnitude limit of m lim = 24, a mass-to-luminosity ratio M/L — 10 Mq/Lq and a K+E-correction 
parametrised by the relation AC log (1 + z). The results for K. — — 1,0, +1 are shown as dotted lines in Fig. 1. The 
general effect of this 'selection bias' is to exaggerate the increase of the bias factor with redshift even further. This is 
particularly evident when small minimum masses M m i n are considered, though it has little effect on the quantitative 
results we have obtained in the next section, and none at all on their qualitative interpretation. We shall not therefore 
discuss this detail any further in our analysis. 



4 SIMPLE MODELS OF CLUSTERING EVOLUTION 

As we mentioned in the Introduction, theoretical interpretations of information on clustering evolution have frequently 
been rather naive. In particular, many observational results are quoted in terms of the parameter e in the following 
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simple scaling model for the redshift evolution of the two-point correlation function £(r, z) at the comoving separation 
r: 

£(r, Z )=£(r/(l + ^),0)(l + z)- (3+E) . (29) 

If the spatial dependence of the two-point function can be fitted by a power-law with slope 7, the above relation 
further simplifies to 

£(r,z) = (r/r c )^(l + zr (3 -^\ (30) 

where r c is a constant measuring the unit crossing of £ at z — 0. 

Recent observational studies (e.g. Le Fevre et al. 1996; Shepherd et al. 1997; Carlberg et al. 1997; Roche, Eales 
& Hippelein 1997; Woods & Fahlman 1997) have served to highlight the importance of understanding the validity 
(or otherwise) of the simple models fej) & (BOh. It is still commonplace for observational data to be framed in terms 
of e as if this parameter had some unambiguous theoretical significance. For example, Le Fevre et al. (1996) and 
Shepherd et al. (1997) have reported a value of e ~ 1 ± 1 from an analysis of galaxy clustering at moderate redshifts. 
But what does this imply for theoretical models? 

Assuming that clustering grows by gravitational instability alone, the above formulae can be interpreted in a 
few special cases. For e = it reproduces the prediction of the so-called stable clustering model (cf. Peebles 1980) , 
while for e = n + 2 = 7~ 1, it results from the application of linear theory in an Einstein-de Sitter universe to purely 
scale-free power spectra with Pn n (k,0) oc k n . The case where e — 7 — 3 corresponds to a clustering pattern that 
simply expands with the background cosmology as if the galaxies were just painted on a homogeneous background. 

Concerning the case of stable clustering (e = 0), one should remember that the idea underlying the stable 
clustering ansatz is that, on sufficiently small scales, gravity acts to stabilize the number of neighbours of an object 
in a proper volume, after this has turned around from the universal expansion. Numerical simulations, however, 
suggest that this type of dynamical regime is only entered, if at all, when the mass autocorrelation function is at 
least as large as ~ 100 (e.g. Efstathiou et al. 1988; Bagla & Padmanabhan 1996; Padmanabhan 1996; Munshi & 
Padmanabhan 1997; Jain 1997; Munshi et al. 1997), which only occurs on scales much smaller than the dimensions 
of typical surveys. Melott (1992) considered the growth of clustering in numerical simulations for an ensemble of 
scale-free models. He found that the lower the value of the spectral index n, the larger is the value of the parameter 
a = 3 — 7 + e and that positive values of e are easily allowed for in all models with n < 1. Melott's explanation 
for such a fast clustering growth is as follows: stable clustering is not an upper limit to the growth of correlations; 
whenever the initial conditions contain non-vanishing large-scale power, merging makes new clusters form and their 
central density increases with time, which in turn enhances the growth of correlations. Moreover, a numerical study 
of the evolution of the two-point function both for the matter and halo population has been carried out by Colin, 
Carlberg & Couchman (1997); they obtain a scale-dependent e parameter which is about 1 for mass particles in an 
Einstein-de Sitter universe, and lower for low-density models. A broad range of values (ranging from —0.2 to 1 in 
the flat case and reaching lower values in the open case) is obtained for haloes, depending on their mean density (see 
also Brainerd & Villumsen 1994). Jain (1997) has discussed the reliability of the general relation of equation fe)|) in 
the context of various models. His conclusions are that the above parametrisation for the evolution of clustering is 
inaccurate in CDM-like models, for two reasons. First, because the growth of £(r, z) with time on intermediate scales 
is much faster than the (1 + z)~' A law prescribed by stable clustering at fixed proper separation (see also PD96) and, 
second, because the boundary between the linear, mildly non-linear and stable clustering regimes, occurs at scales 
which rapidly change with time. 

As one can therefore see, the theoretical interpretation of e is open to some doubt. This doubt widens when 
one considers the different ways e could be defined when the correlation function is not of power-law type and the 
dynamical evolution is not of the self-similar form, situations which are actually expected in realistic models. In this 
case one could define e to be the value at a particular proper or comoving distance r where the slope 7 can be defined, 
perhaps the scale corresponding to ro. Alternatively, one could fit a power- law to all the data and use this to define 7 
and get e that way. In general these values of e will not be equal. Likewise, e at a given scale r need not be constant 
with time (or redshift). 

To illustrate the problem we have calculated, in Fig. 3, the behaviour of e (defined at two fixed proper separations 
corresponding to 1ft -1 Mpc and to the value of ro at the present epoch) for three of our models (SCDMcl, OCDM, 
ACDM); cf. Mo (1997). The value of e at z is obtained by fitting the correlation function £ in the redshift interval 
[0, z\. The results for the mass (solid line) are relatively constant with redshift. Notice that for this case the value of 
e is higher than the stable clustering value, probably due to the effects of merging. But in any case the scale probed 
here is much larger than the scale at which one expects stable clustering to apply in a flat Universe. In the open 
case, the matter distribution actually matches e = quite closely, which is expected because bound objects suffer no 
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Figure 3. The behaviour of the scaling parameter e measured at a fixed physical length scale (equal to l/i -1 Mpc on the left 
and equal to the present correlation length ro on the right) for models with three different background cosmologies. Different 
biasing models arc as Fig. 2. 

future disturbance in open models when free expansion takes over (cf. Padmanabhan et al. 1996). The ACDM model 
is intermediate between these two, with clustering only freezing out much later as the expansion begins to accelerate. 

The situation for the galaxies, however, is much more confused than the case for the matter. Different biasing 
models yield very different predicted behaviours for e(z), which suggests that the usefulness of this parametrisation 
of clustering is most limited for precisely those objects which one could actually observe. Notice also that the value 
of e depends on the scale at which it is measured, adding further confusion to its interpretation. 

As well as e, which measures the rate of evolution, one is also interested in what the characteristic length scale 
of clustering might be as a function of redshift. One way to encode this information is via the quantity ro(z), the 
distance at which the correlation function has unit amplitude. This quantity represents a kind of characteristic scale 
of the clustering pattern, so one might try to compare the sizes of individual structures with this quantity. 

In hierarchical clustering models, the generic expectation is that this (comoving) characteristic scale must decrease 
with increasing redshift. Fig. 4 demonstrates that, while this is certainly true for the distribution of mass, it need 
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Figure 4. The behaviour of the (comoving) correlation length ro (in units of h~ 1 Mpc) as a function of redshift for the various 
models considered. Different biasing models are as in Fig. 2. Note that, although the matter correlation length always decreases 
in comoving coordinates, the correlation function of galaxies need not do so if there is significant and evolving bias. 

not be true for galaxies selected with particular forms of bias. An ro that increases with redshift is obtained in both 
merging and transient models in most cases. 

The fundamental point that arises from these considerations concerns the approach one adopts to test theories. In 
the present situation, one is attempting to eliminate some particular well-defined models from a shortlist of contenders. 
In other words, our aim is hypothesis testing. This kind of test is best performed in the 'observational plane', i.e. by 
computing exactly what an observer would see in a given model universe and comparing it with what is seen in ours. 
It is not useful in our view to treat the problem as one of inference, where one tries to fit a model parameter (in this 
case e) to the observations, particularly a parameter which is of such limited usefulness and theoretical significance. 



5 APPLICATIONS AND RESULTS 
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5.1 The surveys 



In the following subsections we will apply our formalism to the correlation analyses (both angular and projected) of 
three different datasets, recently constructed for the study of the distribution of high-redshift galaxies. 

The Canada-France Redshift Survey (CFRS; Le Fevre et al. 1996 and references therein) consists of 591 galaxies 
with certain spectroscopic redshifts and magnitudes in the range 15.5 < Jab < 22.5. The sample covers 71 square 
arcminutes. The redshift distribution (Crampton et al. 1995) extends up to z ~ 1.6 with more than 60 per cent of 
galaxies with redshift larger then z = 0.5. 

The Hawaii Keck K-band sample used in the study of Carlberg et al. (1997) is an almost complete sample up to 
K — 20, I = 23 and B — 24.5 magnitudes covering an area of about 27 square arcminutes. The redshift distribution, 
presented in Carlberg et al. (1997), is slightly changed with respect to an earlier version of that paper, used in Paper 
I. Now it contains 248 galaxies, the 80 per cent of them with redshifts between z — 0.28 and z — 1.39. 

The Hubble Deep Field (HDF) is a program of very deep observations made from the Hubble Space Telescope 
in four passbands and covering a field at high galactic latitude. Different galaxy catalogues have been extracted 
from the HDF (Williams et al. 1996; Clements & Couch 1996). In their analysis, Villumsen, Freudling & da Costa 
(1997) use a total of 1732 galaxies detected in the F606W filter which has characteristics similar to an R passband. 
Recently estimates of the redshift distribution based on photometric redshifts have been obtained by different authors 
(Mobasher et al. 1996; Sawicki, Lin & Yee 1997; Connolly et al. 1997). In order to compare our predictions with the 
Villumsen, Freudling & da Costa (1997) results, we are forced to use the same redshift distribution adopted in that 
paper, i.e. 

2 

M{z) = 2.723— exp[-(z/z ) 2 - 5 ] , (31) 

z o 

where zo is the median redshift. 

The last dataset here considered is the survey for z ~ 3 galaxies recently started by Steidel et al. (1996, 1998). 

Their observations use the U n , G and R photometric system that is sensitive to the Lyman break in high-redshift 

objects. After spectroscopic confirmation, they found 67 objects with redshift z > 2 in one of their fields (SSA22a+b), 

covering an angular area of 8.74' x 17.64'. 

5.2 The CFRS angular correlation function 

In Fig. 5 we show the model predictions for the angular correlations of the CFRS catalogue limited to z < 1.6. In the 
same plot we also show the observational results obtained by Hudon & Lilly (1996) using two different methods which 
likely bracket the true values: local and global determinations are presented by open and filled squares, respectively. 
Notice that for SCDM, only the merging model is compatible with the data, indicating that one cannot reconcile 
the objects in this survey with observed galaxies at low redshift. All other biasing models fail to reproduce the 
data within the SCDM framework: the predicted evolution of clustering is too strong in this scenario. In the other 
cosmological models, the transient model is always compatible with the data, and for some of them (SCDMcl, 
TCDM, TCDMgiv and OCDM) also the unbiased model can roughly reproduce the data. The data are incompatible 
with both the merging and galaxy-conserving models for any cosmology (cf. Roukema & Yoshii 1993). 

5.3 Keck K-band angular correlation function 

Carlberg et al. (1997) computed the angular correlation function for the Keck dataset limited to z < 1.6. The 
comparison between these results and our various models is shown in Fig. 6. 

The rather large errors on the observational correlations mean that discriminatory power is less than in the 
previous case. Basically, all biasing schemes are compatible with the data in any of the models, although the merging 
model predictions are uncomfortably high for both versions of SCDM and TCDM (cf. Roukema & Yoshii 1993). 

5.4 Hubble Deep Field Angular Correlations 

Villumsen, Freudling & da Costa (1997) computed the two-point angular correlation function for the HDF survey 
using eight different magnitude limits, ranging from R — 26 to R = 29.5. Here we prefer to compare the predictions 
of our different models to the observational results only for the catalogue with limit R = 29, corresponding to a 
median redshift zo = 1.87 in equation (|3lj). With this choice the observational results have the smaller errorbars and 
are expected to be more discriminant. In fact, the results, shown in Fig. 7, impose impressively strong constraints on 
the combination of biasing scheme and cosmological model. All combinations are excluded for SCDM. The merging 
model is always excluded in any cosmology (cf. Roukema & Yoshii 1993). The galaxy-conserving model can fit the 

© 0000 RAS, MNRAS 000, 000-000 



16 Moscardini et al. 



CFRS 



~i — I — i — i — i — i — I — i — i — i — r 



i — i — I — i — i — i — i — r 

SCDM 



i — i — I — i — i — i — i — I — i — i — i — i — I — i — i — i — i — I — i — i — i — i — r 

SCDM CL 



o 



o 



o 



J3 

o 



o 



-- - - _ 



1 - 




-3 


-1 



-3 



TCDM 



OCDM 



TCDM 



GW 




ACDM 




0.5 1 1.5 2 2.5 0.5 1 1.5 2 2.5 

log 6 (arcsec) log 6 (arcsec) 



Figure 5. Theoretical prediction in different cosmological models for the angular galaxy correlation function from the Canada- 
France Rcdshift Survey. The galaxies have z < 1.6 and M(z) is taken from Crampton et al. (1995). Correlation data are from 
Hudon & Lilly (1996) and are obtained by using two different methods which bracket the true values: the local and global 
determinations are shown by open circles and filled squares, respectively. Different bias models arc considered, as in Fig. 2. 



data only within the ACDM model. The models which appear to be in best agreement with the data are the unbiased 
and transient bias schemes in low density models (either with or without a A-term). 



5.5 CFRS Projected Correlation Function 

The CFRS data have been analysed also in terms of the projected correlation function by Le Fevre et al. (1996). They 
divided the galaxies in three different strips in redshift with median redshift z ~ 0.34, z ~ 0.62 and z « 0.86. We 
use these median redshifts to rescale in comoving coordinates the projected separations, originally plotted in proper 
coordinates. Their correlations have been computed by using go = 0.5; consequently the results have to be translated 
for different models because both w(r p ) and the distance r depend on cosmology. For this goal we follow Peacock 
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Figure 6. Theoretical prediction in different cosmological models for the angular galaxy correlation function from the Hawaii 
K-band Survey. The galaxies arc in the redshift range < z < 1.6, with redshift distribution and correlation data taken from 
Carlberg et al. (1997). The original data arc corrected to take into account the dilution produced by the uncorrclatcd foreground 
stars. Different bias models are shown as in Fig. 2. 

(1997), particularly his discussion of the same observational dataset in Section 4.1 of that paper and specifically using 
his equation (40). 

The results, presented in Fig. 8, show that the transient model is compatible with the data for all cosmologies, 
though this is marginal in the case of SCDM. The unbiased model is compatible with the data for SCDMcl, TCDM 
and OCDM. On the other hand, the merging and galaxy-conserving models are always inconsistent. 

5.6 Keck K-band Projected Correlation Function 

The projected correlation function has been computed also for the Hawaii Keck K-band survey by Carlberg et al. 
(1997). They present the results for four different redshift strips, with median redshift z ~ 0.34, z ~ 0.62, z « 0.97 
and z ~ 1.39, by adopting go = 0.1. As before, we rescale in comoving coordinates by using the median redshifts 
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Figure 7. Theoretical prediction in different cosmological models for the angular galaxy correlation function for the Hubble 
Deep Field. The results are for the sample with magnitude limit R = 29 and median redshift zrj = 1.87. The redshift distribution 
is given by equation (31). The shaded region in the plots refers to the la range allowed by the fit obtained on the observational 
data by Villumscn, Frcudling & da Costa (1997). Different bias models are shown as in Fig. 2. 

and we translate the observational results for different cosmological models following Peacock (1997). Notice that the 
observational data used here are different with respect to those used in Paper I presented in an earlier version of the 
Carlberg et al. paper. 

Interpretation of the results, reported in Fig. 9, is slightly complicated by the strange shape of the measured 
correlation function at low z. One could resort to a scale-dependent bias to solve this difficulty (see below), but in 
any case this makes it difficult to exclude models on the basis of the results for the low redshift bin. It is worthwhile, 
however, considering what one might conclude if some of these results were subject to an unknown error. If one 
accepts the points at small separations as being 'accurate', then they favour the number-conserving and merging 
models in all the cosmologies, and also are consistent with an unbiased model for ACDM and SCDM. If instead one 
discards these points and concentrates on the intermediate separation points, they favour the transient and unbiased 
models for SCDM, TCDM and OCDM. 
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Figure 8. Theoretical prediction in different cosmological models for the projected galaxy correlation function of the Canada- 
France Redshift Survey sample as a function of the (comoving) separation r p (in units of h^ 1 Mpc). The redshift distribution 
is given by Crampton et al. (1995). Correlation data are from Le Fevre et al. (1996). Different rows refer to different strips in 
redshift: 0.2 < z < 0.5 (top), 0.5 < z < 0.75 (centre) and 0.75 < z < 1 (bottom). Different bias models are shown as in Fig. 2. 

The data at larger redshifts have much larger errors, but it emerges robustly that the merging model is excluded 
by the data for any cosmology. Generally speaking the unbiased and transient model are reasonable fits in all cases 
considered, though for SCDM the unbiased case is only marginally acceptable. For consistency, these results at higher 
z lead one to prefer the interpretation that the putative problem with the low-redshift data does indeed affect the 
small-separation points, rather than those at larger scale but this argument is, of course, not rigorous. 

5.7 Lyman-Break Galaxies 

Steidel et al. (1996, 1998) have reported evidence for the existence of a strong concentration of galaxies at z ~ 3 in 
their angular field. This 'spike' contains 15 objects (plus one faint QSO) in a redshift bin of width Az = 0.04. Various 
authors have discussed the probability of such an object arising in particular cosmological scenarios (Mo & Fukugita 



© 0000 RAS, MNRAS 000, 000-000 



20 Moscardini et al. 



Hawaii Keck K-band 



iiii|iiii|Mii 
: SCDM : 



0.2<z<0.4 



1 1 1 1 1 1 1 1 1 1 1 1 1 

SCDM CL : 

0.2<z<0.4 



iiii|iiiiiiiii 

: TCDM : 

0.2<z<0.4" 



iiii|iiiiiiii 

; TCDM Gw ; 

0.2<z<0.4 



iiii|iiiiiiii 

: OCDM 



0.2<z<0.4 



i ii ii in 

ACDM : 

T 0.2<z<0.4" 



o 



00 




0.4<z<0.8 



0.4<z<0.8 



0.4<z<0.8 




0.4<z<0.8 




0.4<z<0.8 



0.4<z<0.8 



-2 -1 0-2-10 -2 -1 0-2-10 -2 -10-2-10 1 



log r 



log r log r log r log r log r 

Op l3 P P P P 



Figure 9. a. Theoretical prediction in different cosmological models for the projected galaxy correlation function of the 
Hawaii Keck K-band survey as a function of the (comoving) separation r p (in units of h^ 1 Mpc). The redshift distribution 
and correlation data are from Carlberg et al. (1997). The 1<t bootstrap and the poissonian errorbars are shown by narrow and 
wide error flags, respectively. Different rows refer to different strips in redshift: 0.2 < z < 0.4 (top) and 0.4 < z < 0.8 (bottom). 
Different bias models are shown as in Fig. 2. 

1996; Baugh et al. 1997; Steidel et al. 1998; Jing & Suto 1998; Governato et al. 1998; Bagla 1997a; Wechsler et al. 
1997; Peacock et al. 1998), reaching somewhat equivocal conclusions. 

Although not designed for this particular problem, which can only be resolved in an entirely satisfactory fashion 
using N-body simulations, the formalism we have constructed in this paper can be used to shed qualitative light 
on the concentration of Lyman-break galaxies in a very simple way. In principle, the correct theoretical tool to this 
purpose would be the formula for the probability of finding N objects in a given volume, which for each model depends 
on both the mean number of objects and on all the hierarchy of correlation functions, suitably smoothed over the 
volume (White 1979). However, no sound theoretical predictions exist for the evolution of the entire hierarchy of 
correlation functions into the non-linear regime. We can, however, get a useful insight by calculating the expected 
number of neighbours Nr within a distance R, given the presence of an object at the origin. This is larger than the 
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Figure 9. b. As Fig. 9a but for different strips in redshift: 0.8 < z < 1.2 (top) and 1.2 < z < 1.6 (bottom). 



mean number of galaxies an a randomly-selected volume by factor of [1 + £(-R)]; this factor therefore measures the 
average 'excess' number of galaxies that tend to accompany a given galaxy. At redshift z, the quantity Nr is related 
to the integrated mass correlation function £ by Nr — N[l + b e g(z)^(R, z)], where N is the mean number of objects 
in a sphere of radius R. In order to calculate this we need to know three different quantities: the value of N; the 
radius R corresponding to the volume of the considered bin; the appropriate model of bias. 

The value of the mean number of objects can be taken directly from the smoothed redshift selection function, 
obtained by Steidel et al. (1998) from the whole survey. From their Fig. 1, it is possible to infer that N ~ 4.5 at 
z ~ 3. As for the radius R, the volume of the bin depends on the cosmology because of the dependence of proper 
distances on angles and redshift intervals. The translation of the angular size of the field and the width of the redshift 
bin into volumes therefore depends upon the parameters Q. m and Qa- The bin is roughly equivalent to a sphere with 
R — 7.5ft -1 Mpc in a universe with Qom = 1 and a factor ~ 1.5 larger in the other cosmologies here considered. 
It is also possible that redshift-space effects might play a role in the interpretation of these results. In particular, it 
seems quite likely (on the basis of its high redshift) that the concentration of Lyman-break galaxies is still collapsing. 
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Table 3. The predictions of the expected number of Lyman-break galaxies inside a sphere of radius R = 7.5, -R = 10 and 
R = 12.5 h Mpc (N7.5, Nio and N12.5 respectively) for different cosmological models. Two different values for the minimum 
cutoff mass (M m i n = 10 11 and lO 12 h _1 M0) are used. The values of the effective bias b c ff at redshift 2 = 3 and the comoving 
correlation length rg (in units of h~ 1 Mpc) are also reported. 
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Collapse in the observer's line-of-sight would tend to enhance the concentration observed in redshift space relative to 
the real space concentration and meaning that the real space length scale of the structure should be larger than that 
perceived in redshift measurements. This argues for a larger value of R than the previous values. However, Steidel 
et al. (1998) and Bagia (1997a) have shown that these effects are not particularly important on the scale of the bins 
chosen: the former authors put an upper limit of about 10 per cent on the redshift distortion factor. In order to allow 
fully for the redshift distortions and possible background geometries, in the following analysis we will consider, for all 
the models, three different values of R: R = 7.5, R — 10 and R = 12.5/i _1 Mpc. We will call the corresponding results 
Nr.5, Nio and N12.5, respectively. The smaller values are probably more realistic for Einstein-de Sitter universes, 
whereas the larger pair brackets the range for open and A-dominated models. 

The last problem is the choice of the bias model. As already discussed in Section 3, the process of merging is 
expected to dominate at high redshift, when structures are still forming hierarchically. In order to mimic the behaviour 
of the Lyman-break galaxies we can therefore reasonably assume the behaviour of b e g shown in Fig. 1 (with the fits 
reported in Table g). The results for two different values for the minimum cutoff mass (Af m i n = 10 11 and 10 12 /i _1 Mq) 
are reported in Table j3|. 

It is clear from Table 3 that, for all choices of the parameter M m i n and for all allowed values of R, the expected 
number is always smaller than the observed one. However, the mean number of objects in a randomly-selected bin at 
this redshift would be around 4.5. If the presence of one galaxy in the bin is imposed then this number rises to the 
number given in the table. If a typical fluctuation can raise the number from 4.5 to around 10, as it can for models 
with the higher minimum mass, then a number around 15 is certainly not an inconceivably large fluctuation. We 
would expect fluctuations about the mean excess to be at least of the same order as the mean itself. Only SCDM (and, 
more marginally, ACDM) seems to have serious problems getting close to the value required, mainly due to the low 
value of the bias parameter. Obviously, the predicted numbers decrease when larger radii are considered, increasing 
the gap between model predictions and observations. Consequently, the effect on the final results of including redshift 
distortions can be very strong. 

Another comment can be made on the minimum mass. In order to have better agreement with the Steidel et al. 
(1998) result, Af m i n has to be of order 10 12 /i _1 Mq or more. This seems to indicate that the Lyman-break galaxies 
should be interpretated as progenitors of massive galaxies at the present epoch (e.g. Steidel et al. 1998) or precursor 
of present day cluster galaxies (e.g. Governato et al. 1998) . 

Consistency of these Lyman-break galaxy data with the model predictions also requires that the mean number of 
objects estimated by Steidel et al. (1998) is generally smaller (because of selection effects) than the mean number of 
these objects predicted by the theory. This can be estimated in our formalism by using the Press-Schechter formula 
to get the number of haloes more massive than Af m ; n at redshift z ~ 3. We checked that all our models satisfy this 
constraint (see also Jing & Suto 1998), but predicting precisely which haloes give rise to a Lyman-break galaxy is 
beyond the scope of our theory. 

Our formalism also allows to predict the spatial correlation function £ b a of the Lyman-break galaxies. To this 
aim we use equation (111), where the redshift distribution is taken from Steidel et al. (1998) and the bias models is 
chosen as before. The results obtained for a minimum mass of 10 12 /i _1 M@ (that we found to be in better agreement 
with the observation of the concentration of 15 galaxies at z ~ 3) are shown for the different cosmological models in 
Fig. 10. From their N-body simulations, Governato et al. (1998) found that the effect of redshift distortions is strong 
at (comoving) scales smaller than ~ l/i -1 Mpc (see also Wechsler et al. 1997). For this reason we prefer to plot our 
results only for larger scales. We find that the predictions for the various models are quite different, in agreement 
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Figure 10. Theoretical prediction in different cosmological models for the observed spatial correlation function of the Lyman- 
break galaxies as a function of the (comoving) separation r (in units of /i -1 Mpc). The redshift distribution is taken from 
Steidel et al. (1998). A minimum mass of 10 12 h~ 1 Mq is used to compute the effective bias. 

with the analysis of Wechsler et al. (1997). The correlation length ro (reported in Table m ranges from 2.7/i _1 Mpc 
for SCDM to 7.3/i _1 Mpc for TGDMgw ■ These differences, mainly due to the large spread in the value of the bias 
parameter b e « at high redshifts, seem to indicate that the measurement of the correlation function of these objects 
(when reliably available) can be used to constrain the cosmological models. 



5.8 General Comments and Caveats 

The problem posed by several of these data sets concerns the slope of the correlation function rather than its amplitude. 
One should not at this stage, however, infer very negative conclusions about cosmological structure formation scenarios 
on the basis of the shape. As we mentioned above, we have assumed that the bias is modelled by a constant bias 
factor. As was shown by Coles (1993), this is not the generic expectation even in local bias models and, indeed, one 
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expects to see a steepening of the correlation function on small scales resulting from the introduction of non-linear 
terms into a generic biasing relation of the form 

<S n (x; M, z) ~ / M , z [5m(x, z)) . (32) 

The more non-linear the function /, the steeper one expects the galaxy correlation function to be compared with 
the matter correlations. One also expects this phenomenon to be more prominent when the linear bias factor (which 
can be thought of as the first term in a series approximation to /) is large, i.e. at high z. However, the situation in 
realistic scenarios is not as clear cut as this. In biasing models based on the properties of dark matter haloes, we are 
not dealing with a generic Taylor expansion but a specific one dealing with the relationship between haloes and mass. 
As shown by Catelan et al. (1998) [see their equation (48)], the fact that the linear bias calculated according to the 
Mo & White formula is high implies that the dominant contribution comes from halo masses much larger than M* 
at the relevant epoch. This, in turn, means that the mass field smoothed on that scale is very close to linear. In such 
a case the Mo & White result becomes more and more accurate and higher-order corrections become negligible. So 
while corrections to the linear bias formula are certainly possible, they are not necessarily required simply because the 
bias is large. In any case, the observed correlation functions do tend to be steeper than the theoretical predictions at 
small separations, especially in Fig. 9a, so this might well be connected with these effects and should not necessarily 
lead one to argue that none of the models we present is compatible with the data. A non-linear bias may also play a 
role in the behaviour the Lyman-break galaxies. 

It is also quite possible for the bias to be even more complicated than this. In particular, it may be of non-local 
form so that the propensity of a galaxy to form at a particular position depends not only on the density at that point, 
but on the density at surrounding points. Such a non-local bias may be induced by astrophysical effects resulting 
in some kind of feedback (e.g. Babul & White 1991; Bower et al. 1993). A non-local bias is also induced purely 
dynamically, because haloes remember the conditions at their Lagrangian birthplace (Catelan et al. 1998). 

It is also worth mentioning the somewhat surprising fact that the differences in predictions of the cosmological 
models considered, while they are significant, are not perhaps as large as one would naively imagine. In particular, one 
might have expected the ACDM model and OCDM model to display the biggest differences because the linear growth 
law is so different in these cases. One can see, however, that when non-linear and bias evolution are incorporated, 
these models make predictions for most of the observational setups that are not drastically out of line with the other 
scenarios considered. 

Finally, in this section we remind the reader that the correlation amplitudes measured by observers in our theo- 
retical universes would be even larger than the quantities we have presented because of the effect of the amplification 
bias introduced by gravitational lensing. Since most of the failed models are excluded because they overpredict the 
strength of clustering anyway, this only reinforces our conclusions. 



6 DISCUSSION AND CONCLUSIONS 

In this paper we have explored a number of issues arising from the confrontation of observational evidence of high- 
redshift clustering against observations. We have stressed the importance of constructing exact statistical descriptions 
of clustering so that this confrontation can be carried out in an objective and accurate way. The calculation of the 
statistical quantities required to test particular models is not trivial because it demands the inclusion of a number of 
different effects but, as we have shown, this can be done when all relevant aspects are modelled systematically. 

As in Paper I, we have stressed the crucial importance of understanding more clearly the relationship between 
galaxies and mass, and how this relationship evolves with cosmic epoch. Even the simplest plausible models of bias 
introduce large uncertainties into the clustering pattern predicted in different theories. We also emphasise that these 
biasing schemes are probably over-simplifications: the bias may well be non-local and/or scale-dependent and may 
involve significantly more astrophysics than we have included in our discussion. It is a first priority to understand 
much better the relationship between galaxies and the underlying matter distribution, particularly the relationship 
between galaxy properties and those of the parent haloes. Some progress is clearly being made in this area by the 
application of phenomenological models of hierarchical galaxy formation (e.g. Kauffmanm, Nusser & Steinmetz 1997). 
Ultimately, however, the way forward will probably involve all-inclusive numerical simulations that can handle gravity, 
hydrodynamics and star formation simultaneously. On the other hand, it is reassuring that even the relatively small 
data sets available to us have allowed sizeable chunks of the parameter space of these models to be eliminated. In the 
meantime, we can be reasonably confident that further data will lead to stronger constraints on the simple models 
available at present. 

We have compared some currently fashionable models of structure formation with the available observational 
data using the statistical tools mentioned in the previous paragraph. The present data have fairly large experimental 
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errors, but do offer significant power to discriminate between models. This situation can only improve as more and 
better high-redshift data are accumulated. In particular we found Hubble Deep Field (HDF) data to be highly 
discriminatory. More data of this type, such as is anticipated from the proposed further deep surveys with HST, 
would be extremely useful. 

The details of the comparison of observations with data are discussed in the previous Section, but it is worth 
emphasizing a few general inferences that can be drawn by taking all the results together. First, we can conclude that 
if the 'correct' model of large-scale structure is indeed one of those we have discussed here, then both the rapid merging 
and galaxy-conserving models of galaxy formation are excluded by the data. The second point is that those cosmologies 
that can reproduce the observed abundance of rich clusters can also match the galaxy clustering observations, but 
only if galaxies are no more than moderately biased at redshifts of order unity. Since the models we are testing 
involved a complicated interplay of the various components (background cosmology, perturbation spectrum, biasing 
scheme, etc.) it is difficult to draw deeper conclusions from the data about any one of these components. In particular, 
one might have hoped that the rate of evolution of galaxy clustering might lead one more-or-less directly to the value 
of the density parameter, Q. Although the available data show no strong preference for either high or low values of 
Q,, the OCDM model does seem to fit both amplitude and shape of the available marginally better than models with 
a higher density; this is particularly so at relatively low redshifts. On the other hand, the data do generally prefer 
a value of the bias parameter of order unity. A low value for the bias parameter of bright galaxies tends, on other 
grounds, to favour Q < 1 (e.g. Peacock 1997; Coles & Ellis 1997). 

Finally, we stress that constraints emerging from clustering arguments, like those we have presented here, are 
significantly more robust than those based solely on number-densities, which are very sensitively dependent on 
assumptions about the halo parameters and galaxy formation efficiency. Quantities based on overdensities, such as 
£(r), are constructed to be independent of the underlying number-density of objects and one can, at least in principle, 
use them to make reliable predictions even when the predicted number-density of objects is uncertain. For this reason, 
we expect many useful constraints on models to derive from ongoing and planned observational surveys. 
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